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ON THE LIMITING DISTRIBUTIONS OF MULTIVARIATE 
DEPTH-BASED RANK SUM STATISTICS AND RELATED TESTS 

By Yijun Zuo 1 and Xuming He 2 

Michigan State University and University of Illinois 

A depth-based rank sum statistic for multivariate data intro- 
duced by Liu and Singh [J. Amer. Statist. Assoc. 88 (1993) 252-260] 
as an extension of the Wilcoxon rank sum statistic for univariate 
data has been used in multivariate rank tests in quality control and 
in experimental studies. Those applications, however, are based on 
a conjectured limiting distribution, provided by Liu and Singh [J. 
Amer. Statist. Assoc. 88 (1993) 252-260]. The present paper proves 
the conjecture under general regularity conditions and, therefore, val- 
idates various applications of the rank sum statistic in the literature. 
The paper also shows that the corresponding rank sum tests can be 
more powerful than Hotelling's T 2 test and some commonly used mul- 
tivariate rank tests in detecting location-scale changes in multivariate 
distributions. 

1. Introduction. The key idea of data depth is to provide a center- 
outward ordering of multivariate observations. Points deep inside a data 
cloud are assigned high depths, while those on the outskirts are assigned 
lower depths. The depth of a point decreases when the point moves away 
from the center of the data cloud. Applications of depth- induced ordering 
are numerous. For example, Liu and Singh [12] generalized, via data depth, 
the Wilcoxon rank sum statistic to the multivariate setting. Earlier general- 
izations of the statistic are due to, for example, Puri and Sen [17], Brown and 
Hettmansperger [2] and Randies and Peters [19]. More recent ones include 
Choi and Marden [3], Hettmansperger, Mottonen and Oja [7] and Topchii, 
Tyurin and Oja [23]. A special version of the Liu-Singh depth-based rank 
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sum statistic (with a reference sample) inherits the distribution-free prop- 
erty of the Wilcoxon rank sum statistic. The statistic discussed in this paper, 
like most other generalizations, is only asymptotically distribution-free under 
the null hypothesis. For its applications in quality control and experimen- 
tal studies to detect quality deterioration and treatment effects, we refer to 
[10, 11, 12]. These applications relied on a conjectured limiting distribution, 
provided by Liu and Singh [12], of the depth-based rank sum statistic. Rous- 
son [20] made an attempt to prove the conjecture, but did not handle the 
differentiability of the depth functionals for a rigorous treatment. The first 
objective of the present paper is to fill this mathematical gap by providing 
regularity conditions for the limiting distribution to hold and by verifying 
those conditions for some commonly used depth functions. Empirical process 
theory and, in particular, a generalized Dvoretzk-Kiefer-Wolfowitz theorem 
in the multivariate setting, turns out to be very useful here. 

Our second objective is to investigate the power behavior of the test based 
on the Liu-Singh rank sum statistic. The test can outperform Hotelling's 
T 2 test and some other existing multivariate tests in detecting location-scale 
changes for a wide range of distributions. In particular, it is very powerful 
for detecting scale changes in the alternative, for which Hotelling's T 2 test 
is not even consistent. 

Section 2 presents the Liu-Singh depth-based rank sum statistic and an 
asymptotic normality theorem. Technical proofs of the main theorem and 
auxiliary lemmas are given in Section 3. The theorem is applied to several 
commonly used depth functions in Section 4, Section 5 is devoted to a study 
of the power properties of the rank sum test. Concluding remarks in Section 6 
end the paper. 

2. Liu Singh statistic and its limiting distribution. Let X ~ F and Y ~ 

G be two independent random variables in M. d . Let D(y;H) be a depth 
function of a given distribution H in M rf evaluated at point y. Lin and 
Singh [12] introduced R(y; F) = P F (X : D(X; F) < D(y; F)) to measure the 
relative outlyingness of y with respect to F and defined a quality index 



Since R(y; F) is the fraction of the F population that is "not as deep" as 
the point y, Q(F, G) is the average fraction over all y £ G. As pointed out by 
Proposition 3.1 of Liu and Singh [12], R(Y;F) ~ U[0, 1] and, consequently, 
Q(F, G) = 1/2 when Y ~ G = F and D(X; F) has a continuous distribution. 
Thus, the index Q(F, G) can be used to detect a treatment effect or quality 




(2.1) 



P{D(X; F) < D(Y; F)\X ~ F,Y ~ G}. 
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deterioration. The Liu-Singh depth-based rank sum statistic 




is a two-sample estimator of Q(F,G) based on the empirical distributions 
F m and G n . Under the null hypothesis F = G (e.g., no treatment effect or 
quality deterioration), Liu and Singh [12] proved in one dimension d = 1 
that 

(2.3) ((1/m + l/n)/12)~ 1/2 (Q(F m , G n ) - 1/2) ^ N(0, 1) 

and in higher dimensions, they proved the same for the Mahalanobis depth 
under the existence of the fourth moments, conjecturing that the same lim- 
iting distribution holds for general depth functions and in the general mul- 
tivariate setting. In the next section, we prove this conjecture under some 
regularity conditions and generalize the result to the case F ^ G in order to 
perform a power study. 

We first list assumptions that are needed for the main result. They will 
be verified in this and later sections for some commonly used depth func- 
tions. Assume, without loss of generality, that m < n hereafter. Let F m 
be the empirical version of F and D(- ;■) be a given depth function with 
< D(x; H) < 1 for any point x and distribution H in M. d . 

(Al) P{ yi < D(Y;F) < y 2 } < C\y 2 - yi\ for some C and any Vl ,y 2 £ [0, 1]. 
(A2) sup xgK d \D(x; F m ) — D(x; F)\ = o(l), almost surely, as m — > oo. 
(A3) E(sup xeWLd \D(x;F m ) - D(x; F)\) = 0{m~ 1 ' 2 ). 

(A4) E(Y /i Pix(F m )piY(F m )) = o(m" 1/2 ) if there exist q such that pix(F m ) > 
and p lY {F m ) > for p iZ (F m ) := P{D{Z;F m ) = a | F m ), % = 1, 2, . . . . 

Assumption (Al) is the Lipschitz continuity of the distribution of D(Y; F) 
and can be extended to a more general case with \x 2 — x\\ replaced by 
\%2 — x i\ a for some a > 0, if (A3) is also replaced by E (sup xGK d \D(x; F m ) — 
D(x;F)\) a = 0{m- a / 2 ). The following main result of the paper still holds 
true: 

Theorem 1. Let X ~ F and Y ~ G be independent and X\, . . . ,X m 
and Yi,...,Y n be independent samples from F and G, respectively. Under 
(A1HA4), 

(a 2 GF /m + a 2 FG /n)- 1/2 (Q(F m , G n ) - Q(F, 67)) N(0, 1), as m oo, 
where 

*fg = J P\D(X; F) < D(y, F)) dG(y) - Q 2 (F, G), 
°GF = J P\D{x;F) <D{Y,F))dF{x) -Q 2 {F,G). 
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Assumption (A2) in the theorem is satisfied by most depth functions such 
as the Mahalanobis, projection, simplicial and halfspace depth functions; see 
[9, 15, 25] for related discussions. Assumptions (A3)— (A4) also hold true for 
many of the commonly used depth functions. Verifications can be technically 
challenging and are deferred to Section 4. 



Remark 1. Under the null hypothesis F = G, it is readily seen that 
Q(F, G) = 1/2 and a 2 GF = a 2 FG = 1/12 in the theorem. 

Remark 2. Note that (Al)-(A4) and, consequently, the theorem hold 
true for not only common depth functions that induce a center-outward 
ordering in M. d , but also other functions that can induce a general (not nec- 
essarily center-outward) ordering in M. d . For example, if we define a function 
D(x,F) = F(x) in M 1 , then the corresponding Liu-Singh statistic is equiv- 
alent to the Wilcoxon rank sum statistic. 



3. Proofs of the main result and auxiliary lemmas. To prove the main 
theorem, we need the following auxiliary lemmas. Some proofs are skipped. 
For the sake of convenience, we write, for any distribution functions H, F\ 
and i*2 in M. d , points x and y in R d and a given (affine invariant) depth 
function D (•;•), 

I(x,y,H)=I{D(x;H)<D(y;H)}, 
I(x,y,F 1 ,F 2 )=I(x,y,F 1 )-I(x,y,F 2 ). 

Lemma 1. Let F m and G n be the empirical distributions based on inde- 
pendent samples of sizes m and n from distributions F and G, respectively. 
Then 

(i) ff I(x, y, F) d(G n (y) - G(y)) d{F m {x) - F{x)) = O p {\/yffm), 

(ii) JJI(x,y,F m ,F)d(F m (x)-F(x))dG(y) = o p (l/^h-i) under (A1)-(A2) 

and 

(iii) ff I(x,y,F m ,F)dF m (x)d(G n -G)(y) = Opim-^n- 1 / 2 ) under (Al) 
and (A3). 

Proof. We prove (iii). The proofs of (i)-(ii) are omitted. Let I mn := 
ffI(x,y,F m ,F)dF m (x)d(G n -G)(y). Then 



E(I„ 



<E 



E 



J I(x,y,F m ,F)d(G n -G)(y) 
J I(X 1 ,y,F m ,F)d(G n -G)(y) 



dF m (^x / 
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E E^p^I(Xx,Yi,F m ,F) 



- EyI(Xi,Y, F m ,F)) 

1 



X\,...,X n 



< -E[E Y {(I(Xt, Fi, F m ,F)y\X u . . .,X m }} 
n 

= -E[E Y {\I(X 1 ,Y 1 ,F m ,F)\\X 1 ,...,X m }}. 
n 

One can verify that 

\I(x,y,F m ,F)\<l(\D(x;F)-D(y;F)\<2sn V \D(x;F m )-D(x;F)\). 

By (Al) and (A3), we have 

E(Imn) 2 < —E ( sup \D(x; F m ) - D(x; F)\) = 0(1/ \m l l 2 n)). 

The desired result then follows from Markov's inequality. □ 

Lemma 2. Assume that X ~ F and Y ~ G? are independent. Then under 
(A4), we have JJ I(D(x; F m ) = D(y; F m )) dF{x) dG(y) = aim- 1 / 2 ). 

Proof. Let I(F m ) = JJI(D(x; F m ) = D(y; F m ))dF(x) dG(y). Condition- 
ally on Ai, . . . , X m (or equivalently on F m ), we have 

I(F m )= P(D(X;F m )=D(y; 

Fm) | F m )dG(y) 

J{V.P(D(X;F m )=D(v;F m )\F m )>0} 
= E / P(D(X; F m ) = a | F m ) dG{y) 

i J{y- P(D(X;F m )=D(y; F m )=c t |F m )>0} 

= P (D(X; F rn ) = a | F m )P{D{Y- F m ) = a \ F m ) 

i 

= ^}2Pix(Fm)PiY(F m ), 

i 

where < a < 1 such that P(D(X;F m ) = a \ F m ) = P(D(Y;F m ) = a | 
F m ) > 0. (Note that there are at most countably many such Cj's.) Taking ex- 
pectation with respect to X\ , . . . , X m , the desired result follows immediately 
from (A4). □ 

Lemma 3. Let X ~ F and Y ~ G be independent and let X\, . . . ,X m 
and Y±, . . . ,Y m be independent samples from F and G, respectively. Under 
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(A1)-(A4), 

Q(F m ,G n )-Q(F,G n ) = J J I(x,y,F)dG{y)d(F m {x)-F{x)) + o p ( 

and, consequently, \^rn(Q(F m , G n ) — Q(F, G n )) N(0, o-q F ). 

Proof. It suffices to consider the case F = G. First, we observe that 
Q(F m , G n ) - Q(F, G n ) = J R(y- F m ) dG n (y) - J R(y; F) dG n (y) 

= j J I(x,y,F m )dF m (x)dG n (y) 
- II I(x,y,F)dF(x)dG n (y) 

= 11 [I(x,y,F m )-I(x,y,F)}dF m (x)dG n {y) 

+ 11 1(x,y,F)d(G n (y)-G(y))d(F m (x)-F(x)) 
+ [[ I(x,y,F)dG(y)d(F m (x) - F(x)). 



We shall call the last three terms I mn i I mn 2 and I m 3, respectively. From 
Lemma 1, it follows immediately that y/rnl mn 2 — o p (l). By a standard cen- 
tral limit theorem, we have 

(3.4) ^j m3 ^iV(0,4 F ). 
We now show that s/ml mn \ = o p (l). Observe that 

W = / j I(x,y,F m ,F)dF m (x)d(G n -G)(y) 
+ 11 1(x,y,F m ,F)dF m (x)dG(y) 

= H I(x,y,F m ,F)dF(x)dG(y)+o p (l/^), 
by Lemma 1 and the given condition. It is readily seen that 
1 1 1(x,y,F m ,F)dF(x)dG(y) 

= H I(x,y,F m )dF(x)dG(y)- J j I(x,y, F) dF(x) dG(y) 

= yj[I(D(x,F m )<D(y;F m )) 

+ I(D(x, F m ) > D(y- F m ))\ dF(x) dG(y) - \ 
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= jl(D{x,F m ) = D{y;F m ))dF{x)dG{y) = o(m^ 2 ), 
by Lemma 2. The desired result follows immediately. □ 

Proof of Theorem 1. By Lemma 3, we have 
Q(F m ,G n ) - Q(F, G) 

= (Q(F m ,G n ) - Q(F, G n )) + (Q(F, G n ) - Q(F, G)) 

= JJ I(x,y,F)dG(y)d(F m (x)-F(x)) 

+ JI I(x,y,F)dF(x)d(G n (y) - G(y)) + o p (m-V 2 ). 

The independence of F m and G n and the central limit theorem then give 
the result. □ 

4. Applications and examples. This section verifies (A3)-(A4) [and (A2)] 
for several common depth functions. Mahalanobis, halfspace and projection 
depth functions are selected for illustration. The findings here and in Sec- 
tion 2 ensure the validity of Theorem 1 for these depth functions. 

Example 1 [Mahalanobis depth (MHD)]. The depth of a point x is 
defined as 

MHD(x;F) = 1/(1 + (x - ^(F)) / E" 1 (F)(x - fi(F) j), x G R d , 

where fJ>(F) and S(-F) are location and covariance measures of a given dis- 
tribution F; see [12, 27]. Clearly, both MHD(x;F) and MHD(x;F m ) van- 
ish at infinity as ||x|| — ► oo, where F m is the empirical version of F based 
on Xi, . . . ,X m and n(F m ) and T,(F m ) are strongly consistent estimators of 
n(F) and respectively. Hence, 

sup \MHD(x;F m ) - MHD(x;F)\ = \MHD(x m ; F m ) - MHD(x m ;F)\, 

by the continuity of MHD(x; F) and MHD(x; F m ) in x for some x m = 
x(F m ,F) € R d such that ||x m || < M < oo for some M > and all large m. 
Write, for simplicity, /i and £ for n(F) and S(-F) and /i m and S m for fi(F m ) 
and T,(F m ), respectively. Then 

\MHD(x m -F m ) - MHD{x m -F)\ 

_ | (Mm ~ AQ'S'^/Xm + /■*- 2x m ) + (x m - ^'(E" 1 - S" 1 )^ - fl)\ 

(1 + ||S m 1/2 (x m - /i m )|| 2 )(l + ||E-V2(x ra - A^) || 2 ) 

This, in conjunction with the strong consistency of // m and S m , yields (A2). 
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Holder's inequality and expectations of quadratic forms (page 13 of [21]) 
yield (A3) if conditions (i) and (ii) below are met. (A4) holds trivially if (iii) 
holds. 

(i) fi m and T, m are strongly consistent estimators of fj, and E, respec- 
tively; 

(ii) E{ixm - \t)i = 0(m-V 2 ), ^(E- 1 -S- 1 ) iJfe = 0(m- 1 /2) j l<i,j,k< 
d, where the subscripts % and jk denote the elements of a vector and a 
matrix, respectively; 

(iii) The probability mass of X at any ellipsoid is 0. 

Corollary 1. Assume that conditions (i), (ii) and (iii) hold and the 
distribution of MHD(Y; F) is Lipschitz continuous. Then Theorem 1 holds 
for MRU. 

Example 2 [Halfspace depth (HD)] . Tukey [24] suggested this depth as 

HD(x]F) = inf{P(H x ) : H x closed halfspace with x on its boundary}, 

x£R d , 

where P is the probability measure corresponding to F. (A2) follows im- 
mediately (see, e.g., pages 1816-1817 of [5]). Let H be the set of all closed 
halfspaces and P m be the empirical probability measure of P. Define 

D m {H) :=m 1 / 2 ||P m - P\\ H ■- SU p m^P^H) - P(H)\. 

Note that TL is a permissible class of sets with polynomial discrimination 
(see Section II. 4 of [18] for definitions and arguments). Let SiTL) be the 
degree of the corresponding polynomial. Then, by a generalized Dvoretzk- 
Kiefer-Wolfowitz theorem (see [1, 13, 14] and Section 6.5 of [6]), we have, 
for any e > 0, that P{D m {H) > M) < Ke^ 2 -^ 2 for some sufficiently large 
constant K = K (e, S(H.)). This immediately yields (A3). 

To verify ( A4) , we consider the case F = G for simplicity. We first note 
that HD(X;F m ) for given F m is discrete and can take at most 0{m) val- 
ues Cj = i/m for i = 0, 1, . . . ,m. Let F be continuous. We first consider the 
univariate case. Let 

Aq = M 1 — P)i? m , Ai = P|iT m _j + i — P|il m _j, 
A fc+1 = f|fl m _ fc -f|0, 
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with 1 < i < k and k = \ [m — l)/2j , where Hi is any closed half-line con- 
taining exactly some i points of X±, . . . , X m . It follows that for < i < k, 

P(HD(X;F m ) = Ci \F m ) 

= P(Ai) = [F(X (i+1) ) - F(X (i) )] + [F(X (m _ i+1) ) - F(X (m _ )] 
P(HD(X;F m ) = c k+1 | F m ) = P(A k+1 ) = [F(X {m _ k) ) - F(X {k+1) )}, 

where — oo =: Xr \ < Xry <■•■ < Xr m \ < X( m+1 ) := oo are order statistics. 

On the other hand, Xu\ and i ?_1 (C/( i )) are equal in distribution (=), where 

=: U( \ < Um < ■ • ■ < U( m \ < t/( m +i) := 1 are order statistics based on a 
sample from the uniform distribution on [0, 1]. Let Di = F(X^ i+ ^) —F(X^), 

1 = 0, . . . , m. The D[s have the same distribution and 

^(A) = -At . E ( D i) 



m+l' v l ' (m+l)(m + 2)' 

EiDiDj) 



(m + l)(m + 2) ' 

Hence, for < i < k, E((P(HD(X; F m ) = a \ F m )) 2 ) = 6/((m + l)(m + 2)) 
and E((P(HD(X;F m ) = c k+1 \ F m )) 2 ) = 0(m- 2 ). Thus, (A4) follows imme- 
diately. 

Let us now treat the multivariate case. Let Xi,... ,X m be given. Denote 
by H{ any closed halfspace containing exactly i points of X\ , X m . Define 
the sets 

A® = M d — f~]H m , A\ = P| H m — P| i? m -i 

A m -k = Pi Hk+i — Pi H k , A m _ k+ i = pi H k , 

with (m — k + l)/m = max^gjgd HD(x; F m ) < 1. Then it is not difficult to see 
that 

HD{x; F m ) = i/m, for x£Ai, i = 0, 1, . . . , m — k, m — k + 1. 

Now let pi = P{HD{X] F m ) = Ci \ F m ) with Cj = i/m. Then, for any < i < 
m - k + 1, pi = P(X G Ai) = P{C\H m _ i+l ) - P(C\H m -i) with H m+1 = R d 
and H k _i = 0. Now, treating pi as random variables based on the random 
variables X±, . . . ,X m , by symmetry and the uniform spacings results used 
for the univariate case above, we conclude that the p^s have the same dis- 
tribution for i = 0, . . . , m — k and 

E(pi) = 0(m~ 1 ), E(p 2 ) = 0(m- 2 ), i = 0, . . . ,m - k + 1. 

Assumption (A4) follows in a straightforward fashion. Thus, we have 
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Corollary 2. Assume that F is continuous and the distribution of 
HD(Y;F) is Lipschitz continuous. Then Theorem 1 holds true for HD . 

Example 3 [Projection depth (PD)]. Stahel [22] and Donoho [4] defined 
the outlyingness of a point x 6 W 1 with respect to F in W 1 as 

0(x;F) = sup \u'x-Li(F u )\/a(F u ), 

where S d ~ l = {u: \\u\\ = 1}, /i(-) and er(-) are univariate location and scale 
estimators such that \x{aZ + b) = afi(Z) + b and a(aZ + b) = |a|o"(Z) for any 
scalars a, b 6 M 1 and random variable 2 s! 1 and u'X ~ -F u with X ~ F. 
The projection depth of x with respect to F is then defined as (see [10, 25]) 

PD(x;F) = l/(l + 0(x;F)). 

Under the following conditions on fx and a, 

(CI) sup Mg5 d-i n(F u ) < co, < inf ue5 d-i cx(F u ) < sup ue5 d-i a(F u ) < co; 
(C2) sup ueSd -i\ij,(F mu )-n(F u )\ =o(l), sup ug5 d-i|cr(i ? mu )-a(F u )| =o(l), a.s. 
(C3) Ssup|| u || =1 |/i(F mit )-^(F u )| = 0(m^ 1 /2) ) Esu P || u || =1 |a(F mw ) -<x(F„)| = 

©(yn^ 1 / 2 ), 

where is the empirical distribution based on u'Xi , . . . , u'X m and X\ , . . . , X, 
is a sample from F, Assumption (A2) holds true by Theorem 2.3 of [25] and 
(A3) follows from (C3) and the fact that for any x £ M. d and some constant 
C>0, 

\PD{x-F m )-PD{x-F)\ 

, Q{x;F)\a(F mu ) - a{F u )\ + \fi(F mu ) - fi(F u )\ 
~ uS*-i (1 + 0{x- F m ))(l + 0(x; F))a{F mu ) 

<C sup {\a(F mu )-a(F u )\ + \/j(F mu )-fi(F u )\}. 

«G5 d - 1 

(C1)-(C3) is true for general smooth M-estimators of /i and a (see [8]) and 
rather general distribution functions F. If we consider the median (Med) and 
the median absolute deviation (MAD), then (C3) holds under the following 
condition: 

(C4) F u has a continuous density f u around points (J,(F U ) +{0, ±cr(F u )} 
such that 

inf f u (KFu))>0, 

\\u\\=l 

n inf (fMF u ) + a(F u )) + f u (fi(F u ) - a(F u ))) > 0. 

kt =1 
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To verify this, it suffices to establish just the first part of (C3) for \i = Med. 
Observe that 

F-\l/2 - \\F mu - F u ||oo) - F~\l/2) < n(F mu ) - fi(F u ) 

^ F u (1/2 + ||-Pj7j U Fulloo) 

for any u and sufficiently large m. Hence, 
\fJ-{F mu ) — n(F u )\ <2\\F mu — FuWoo/ inf f u (l^(F u )) := C \\F mu — F u 

by (C4). Clearly, fi(F mu ) is continuous in u. From (C4), together with and 
Lemma 5.1 and Theorem 3.3 of [25], it follows that /J>{F U ) is also continuous 
in u. Therefore, 

P(v^ sup \ii{F mu )-ix{F u )\>t) <P(||F mit0 -F U0 || oo >(t 2 /(^C' 2 )) 1/2 ) 

< 2e -2i 2 /C 2 ) foranyt>0, 

where the unit vector uo may depend on m. Hence, the first part of (C3) 
follows. 

Assumption (A4) holds for PD since P(PD(X; F m ) = c \ F m ) = for most 
commonly used (/j,, a) and F. First, the continuity of fi(F mu ) and cr(F mu ) 
in u gives 

P(PD(X; F m ) = c\ F m ) = P((u' x X - fi(F mUx ))/a(F mUx ) = (1 - c)/c | F m ) 

for some unit vector ux depending on X. This probability is for most F 
and (fJ,,cr). For example, if (//,<r) = (mean, standard deviation), then 

P(PD(X; F m ) = c | F m ) = P(\\S m l ' 2 (X - X m )\\ = (1 - c)/c | F m ), 

where S m = ——^ J2iLii^i ~ X m )(Xi — X m )' , which is provided the mass 
of F on any ellipsoid is 0. Thus, 

Corollary 3. Assume that (C1)-(C3) hold, P(PD(X; F m ) = c \ F m ) = 
for any c > and PD(Y;F) satisfies (Al). Then Theorem 1 holds for PD. 
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5. Power properties of the Liu Singh multivariate rank sum test. 

Large sample properties. A major application of the Liu-Singh multi- 
variate rank-sum statistic is to test the following hypotheses: 

(5.5) H :F = G versus H X :F±G. 

By Theorem 1 , a large sample test based on the Liu-Singh rank-sum statistic 
Q(F m ,G n ) rejects Hq at (an asymptotic) significance level a when 

(5.6) \Q(F m , G n ) - 1/2| > «i_ a/3 ((l/m + l/n)/12) 1/2 , 

where $(z r ) = r for < r < 1 and normal cumulative distribution function 
$(•). The test is affine invariant and is distribution-free in the asymptotic 
sense under the null hypothesis. Here, we focus on the asymptotic power 
properties of the test. By Theorem 1, the (asymptotic) power function of 
the depth-based rank-sum test with an asymptotic significance level a is 

p ( F G) = l $ ( 1/2 ~ G) + (V™ + 1 M/ 12 \ 

\ Ja 2 GF /m + a 2 FG /n J 

(5-7) 

' 1/2 - Q(F, G) - Zl _ a/2 V(l/m + l/n)/12\ 



a 2 GF /m + a FG / 



n 



The asymptotic power function indicates that the test is consistent for 
all alternative distributions G such that Q(F,G) 7^ 1/2. Before studying 
the behavior of (3q(F,G), we shall consider its key component Q(F,G), the 
so-called quality index in [12]. For convenience, consider a normal family 
and let d = 2. Assume, without loss of generality, that F = ^((0, 0)', I2) 
and consider G = A^/^, £), where -Z2 is the 2x2 identity matrix. It can be 
shown that 

Q(F,G) = (|5|/|E|) 1 / 2 exp(- y u / (S- 1 - S" x 5 X" 1 )^) 

for any affine invariant depth functions, where S = (I2 + X" 1 )" 1 . In the case 
fi= (u,u)' and S = cr 2 /2 5 write Q(u,a 2 ) for Q(F,G). Then 

Q(«, <r 2 ) := Q(F, G) = exp(-u 2 /(l + a 2 ))/(l + a 2 ). 

Its behavior is revealed in Figure 1. It increases to its maximum value (1 + 
cr 2 ) -1 [or exp(— u 2 )] as u — > for a fixed a 2 (or as cr 2 — > for a fixed u). 
When u = and cr 2 = 1, Q(F,G), as expected, is 1/2, and it is less than 
1/2 when there is a dilution in the distribution (a 2 > 1). Note that Liu and 
Singh [12] also discussed Q(u,a 2 ). The results here are more accurate than 
their Table 1. 
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Fig. 1. The behavior of Q(F,G) with F = JV 2 ((0, 0)',i 2 ) and G = N 2 ((u,u)' ,a 2 1 2 ) . 

A popular large sample test for hypotheses (5.5) is based on Hotelling's 
T 2 statistic that rejects Hq if 

(5.8) (X m - Y n )'((l/m + l/n)S pooled r\X m - Y n ) > x?_ a (d), 

where S poo i ed = ((m - + (n - l)S Y )/(m + n - 2), I m , F„, Sx and Sy 
are sample means and covariance matrices and Xr(^) i s the rth quantile of 
the chi-square distribution with d degrees of freedom. The power function 
of the test is 

p T 2(F, G) = P((X m - Y n )'((l/m + l/n)S pooled y l (X m - Y n ) 

(5.9) 

>xl- a (d)). 

We also consider a multivariate rank-sum test based on the Oja objective 
function in [7]. The Oja test statistic, O, has the following null distribution 
with N = m + n and A = n/N: 

(5.10) O := (N\(l - X)y 1 T' N BJ f 1 T]sr ^ X \d). 

Here Tat = J2k=i akRN(zk), Rn(z) = rf!( ^7 E pg p S p (z)n p , a k = (l-X)I(k > 
m)-\I{k <m),z k £ {X 1 ,. ..,X m ,Yi,.. .,Y n }, B N = j^iJ2k R N(z k )R' N (z k ), 
P = {p= (i u ...,i d ) : 1 <ix < ••• <i d < N}, S p (z) = sign(n 0p + z'n p ) and 

,/l 1 ... 1 1 \ , 

det = n 0p + z n p , 

\z tl z t2 ... z ld ZJ 
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where uq p and nj p ,j = 1,. .. ,d, are the cofactors according to the column 
(1, z')' . The power function of this rank test with an asymptotic significance 
level a is 

(5.11) fo(F, G) = P((NX(1 - X^-'t^B^Tn > X \-M)- 

The asymptotic relative efficiency (ARE) of this test in Pitman's sense is 
discussed in the literature; see, for example, [16]. At the bivariate normal 
model, it is 0.937 relative to T 2 . 

In the following, we study the behavior of (3q, (3o and (3t 2 - To facilitate 
our discussion, we assume that a = 0.05, m = n, d = 2 and that G is normal 
or mixed (contaminated) normal, shrinking to the null distribution F = 
N 2 ((Q,0)' ,I 2 ). Note that the asymptotic power of the depth-based rank-sum 
test, hereafter called the Q test, is invariant with respect to the choice of 
the depth function. 

For pure location shift models Y ~ G = N 2 ((u, u)' , I 2 ), Hotelling's T 2 
based test, hereafter called T 2 is the most powerful, followed by the Oja 
rank test, to be called the O test, and then followed by the Q test. All of 
these tests are consistent at any fixed alternative. Furthermore, we note that 
when the dimension d gets larger, the asymptotic powers of these tests move 
closer. 

On the other hand, for pure scale change models G = N 2 ((0, 0)', o~ 2 I 2 ), the 
Q test is much more powerful than the other test. In fact, for these models, 
the T 2 test has trivial asymptotic power a at all alternatives. Figure 2, a 
plot of the power functions f3 T 2, (3o and (3q, clearly reveals the superiority 
of the Q test. The O test performs just slightly better than T 2 . 

In the following, we consider a location shift with contamination a scale 
change with contamination and a simultaneous location and scale change as 
alternatives. The amount of contamination e is set to be 10%. The asymp- 
totic power calculations for T 2 and Q are based on the limiting distributions 
of the test statistics under the alternatives. Since the limiting distribution is 
not available for the O test (except for pure location shift models), we use 
Monte Carlo to estimate the powers. 

For G=(l- e)N 2 ((u,u)',I 2 ) + eJV a ((0,0)', (1 + 10uo- 2 )/ 2 ), the contam- 
inated location shift models with u > 0, the (asymptotic) power function 
P T 2(F,G) is P(Z2a > Xo.95(2)), where Z 2a has a noncentral chi-square dis- 
tribution with two degrees of freedom and noncentrality parameter n(l — 
e) 2 ti 2 /(l + heua 2 + e(l — e)u 2 ). Since the derivation of this result is quite 
tedious, we omit the details. Comparisons of (3^2, 0o and (3q are listed in 
Table 1, which clearly reveals that T 2 becomes less powerful than Q when 
a pure location shift model is 10% contaminated. For large fi, O is more 
powerful than Q since the underlying model is mainly a location shift. 

For G = 0.9A r 2 ((0,0) / ,cr 2 / 2 ) + 0.1N 2 ((u,u)' ,I 2 ), the contaminated scale 
change models with a = u + 1 > 1, the (asymptotic) power function f3 T2 is 
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Fig. 2. (3 T2 {F,G) and Pq{F,G) with F = N 2 ((0,0)' ,h), G = JV 2 ((0,0)', cr 2 / 2 ). 



Table 1 

TTie (asymptotic) power of tests based on T 2 , O and Q. 



u 




0.0 


0.15 


0.20 


0.25 


0.30 


0.35 




G = 


0.9JV2 ((«,«) 


',/ 2 )+0.1A r 2 ((0,0)',(l + 10M ( 7 2 )72), 


a = 4 




n= 100 




0.050 


0.117 


0.155 


0.196 


0.239 


0.284 






0.051 


0.245 


0.286 


0.307 


0.381 


0.443 




Bo 


0.046 


0.157 


0.296 


0.423 


0.558 


0.687 


n = 200 




0.050 


0.193 


0.273 


0.357 


0.441 


0.521 




0Q 


0.051 


0.430 


0.508 


0.549 


0.659 


0.746 




Po 


0.056 


0.342 


0.546 


0.712 


0.881 


0.941 



equal to P(Z2b > Xo.9s(2))> where Z 2 b has the noncentral chi-square distribu- 
tion with two degrees of freedom and noncentrality parameter 2ns 2 u 2 /(l + 
e+ (1 — e)a 2 + 2e(l — e)u 2 ). Table 2 reveals the superiority of Q in detecting 
scale changes over T 2 and O, even when the model has a 10% contamination. 

For G = N2((u, it)', cP'l'i)^ the simultaneous location and scale change 
models with a = u + 1 > 1, the (asymptotic) power function (3 T 2 is P(Z2 C > 
Xo.95(2)), where Z 2c has the noncentral chi-square distribution with two de- 
grees of freedom and noncentrality parameter 2nu 2 j '(1 + a 2 ) . Table 3 reveals 
that Q can be more powerful than T 2 and O when there are simultaneous 
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location and scale changes. Here, we selected (a — l)/u = 1. Our empirical 
evidence indicates that the superiority of Q holds provided that (cr — l)/u is 
close to or greater than 1, that is, as long as the change in scale is not much 
less than that in location. Also note that in this model, T 2 is more powerful 
than O. 

Small sample properties. To check the small sample power behavior of Q, 
we now examine the empirical behavior of the test based on Q(F m ,G n ) and 
compare it with those of T 2 and O. We focus on the relative frequencies of 
rejecting Hq of (5.5) at a = 0.05 based on the tests (5.6), (5.8) and (5.10) and 
1000 samples from F and G at the sample size m = n = 25. The projection 
depth with (/i, a) = (Med, MAD) is selected in our simulation studies and 
some results are given in Table 4. Again, we skip the pure location shift and 
scale change models, in which cases, T 2 and Q perform best, respectively. 
Our Monte Carlo studies confirm the validity of the (asymptotic) power 
properties of Q at small samples. 



Table 2 

The (asymptotic) power of tests based on T 2 , O and Q. 







1.0 


1.2 


1.4 


1.6 


1.8 


2.0 






G = 0.9N 2 {(0,0)',a 2 I 2 ) 


+0.1iV 2 (( u , 


u)',h), 


u = a— 1 




n= 100 


P T 2 


0.050 


0.050 


0.052 


0.054 


0.056 


0.059 




Pq 


0.051 


0.181 


0.430 


0.734 


0.891 


0.963 




Po 


0.048 


0.054 


0.057 


0.064 


0.068 


0.070 


n = 200 


P T 2 


0.050 


0.051 


0.054 


0.058 


0.063 


0.068 




Pq 


0.051 


0.299 


0.740 


0.950 


0.994 


1.000 




Po 


0.052 


0.059 


0.063 


0.085 


0.112 


0.139 



Table 3 

The (asymptotic) power of tests based on T 2 , Q and O. 



u 




0.0 


0.15 


0.20 


0.25 


0.30 


0.35 






G 


= N 2 ((u,u) 


',a 2 I 2 ), a 


= u + l 






n= 100 


(3 T 2 


0.050 


0.219 


0.348 


0.493 


0.634 


0.755 




0Q 


0.051 


0.437 


0.662 


0.839 


0.941 


0.983 




(3o 


0.046 


0.218 


0.324 


0.430 


0.573 


0.708 


n = 200 




0.050 


0.404 


0.625 


0.805 


0.916 


0.970 






0.049 


0.725 


0.922 


0.987 


0.999 


1.000 




Po 


0.056 


0.357 


0.569 


0.755 


0.882 


0.944 
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Table 4 

Observed relative frequency of rejecting Ho . 





u 




0.0 


0.15 


0.20 


0.25 


U.oU 


U.oD 






G 


= 0.9N 2 {(u,u 


l',/2)+0.1JV2((0,0)',(l + 10M<T 2 )/ 2 ), 


a = 4 




n 


= 25 




0.058 


0.083 


0.108 


0.142 


0.151 


0.189 






0Q 


0.057 


0.154 


0.156 


0.170 


0.203 


0.216 






Po 


0.047 


0.084 


0.116 


0.152 


0.201 


0.254 




a 2 




1.0 


1.2 


1.4 


1.6 


1.8 


2.0 








G = 0.9A^2((0,0)',o- 2 / 2 )4 


■0.1N 2 ((u,u] 


\',h), u = cr 


- 1 




n 


= 25 


P T 2 


0.059 


0.063 


0.059 


0.073 


0.061 


0.067 






0Q 


0.063 


0.145 


0.243 


0.377 


0.469 


0.581 






Po 


0.051 


0.058 


0.041 


0.053 


0.043 


0.055 




u 




0.0 


0.15 


0.20 


0.25 


0.30 


0.35 








G 


= N 2 ((u,u)' 




= u + l 






n 


= 25 


P T 2 


0.069 


0.113 


0.147 


0.183 


0.220 


0.269 






Pq 


0.060 


0.245 


0.324 


0.418 


0.498 


0.587 






Po 


0.044 


0.082 


0.089 


0.127 


0.197 


0.221 



6. Concluding remarks. This paper proves the conjectured limiting dis- 
tribution of the Liu-Singh multivariate rank-sum statistic under some reg- 
ularity conditions which are verified for several commonly used depth func- 
tions. The asymptotic results in the paper are established for general depth 
structures and for general distributions F and G. The Q test requires nei- 
ther the existence of a covariance matrix nor the symmetry of F and G. 
This is not always the case for Hotelling's T 2 test and other multivariate 
generalizations of Wilcoxon's rank-sum test. 

The paper also studies the power behavior of the rank-sum test both 
asymptotically and empirically. Although the discussion focuses on the nor- 
mal and mixed normal models and d = 2, what we learned from these in- 
vestigations is typical for d > 2 and for many non-Gaussian models. Our 
investigations also indicate that the conclusions drawn from our two-sample 
problems are valid for one-sample problems. 

The Liu-Singh rank-sum statistic plays an important role in detecting 
scale changes similarly to the role played by Hotelling's T 2 in detecting 
location shifts of distributions. When there is a scale change in F, the depths 
of almost all points y from G decrease or increase together and, consequently, 
Q(F,G) is very sensitive to the change. This explains why Q is so powerful 
in detecting small scale changes. On the other hand, when there is a small 
shift in location, the depths of some points y from G increase, whereas those 
of the others decrease and, consequently, Q(F,G) will not be so sensitive to 
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a small shift in location. Unlike the T 2 test for scale change alternatives, the 
Q test is consistent for location shift alternatives, nevertheless. 

Finally, we briefly address the computing issue. Hotelling's T 2 statistic is 
clearly the easiest to compute. The computational complexity for the O test 
is 0(n d+1 ), while the complexity for the Q test based on the projection depth 
(PD) is 0{n d+2 ). Indeed, the projection depth can be computed exactly by 
considering 0{n d ) directions that are perpendicular to a hyperplane deter- 
mined by d data points; see [26] for a related discussion. The exact computa- 
tion is of course time-consuming. In our simulation study, we employed ap- 
proximate algorithms (see http://www.stt.msu.edu/~zuo/table4.txt), 
which consider a large number of directions in computing Q and O. 
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